CDS

Accession Number TCMCG075C29222
gbkey CDS
Protein Id XP_017984785.1
Location join(2273883..2273963,2274490..2274576,2274756..2274843,2274937..2275028,2275143..2275207,2275290..2275378,2275654..2275731,2275822..2275915,2276009..2276075,2276160..2276223,2276313..2276353,2276569..2276754)
Gene LOC18586130
GeneID 18586130
Organism Theobroma cacao

Protein

Length 343aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018129296.1
Definition PREDICTED: UDP-glucuronic acid decarboxylase 5 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category GM
Description udp-glucuronic acid decarboxylase
KEGG_TC -
KEGG_Module M00361        [VIEW IN KEGG]
KEGG_Reaction R01384        [VIEW IN KEGG]
KEGG_rclass RC00508        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K08678        [VIEW IN KEGG]
EC 4.1.1.35        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00520        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
map00520        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCTGCAAACGGTGATCACAACTCAGCTTCTAAGAAGCCTCCGAGTCCATCTCCTTTGAGATTTTCCAAGTTTTTCCAGTCCAACATGAGAATTCTGGTTACTGGAGGAGCTGGATTCATTGGCTCCCACCTAGTGGACAAGTTGATGGAGAATGAAAAGAATGAGGTTATTGTTGTTGATAACTTCTTTACTGGCTCCAAGGACAACCTAAGGAAATGGATTGGGCATCCAAGATTTGAACTTATTCGTCATGATGTAACTGAGGCATTGCTAGTTGAGGTTGATCAGATATACCATCTTGCTTGCCCAGCTTCTCCAATTTTCTACAAATACAATCCTGTGAAGACAATAAAGACAAACGTGATTGGTACATTGAATATGTTGGGACTTGCAAAGCGTGTTGGAGCAAGGATTTTGCTTACGTCAACTTCAGAGGTATACGGAGATCCACTTGAGCATCCCCAGACTGAGAGCTATTGGGGCAATGTTAACCCAATTGGAGTTAGGAGCTGCTATGATGAGGGAAAACGAGTGGCTGAAACTTTGATGTTTGATTACCATAGGCAGCATGGCATAGAGATTCGGATTGCTAGAATTTTCAACACTTATGGACCACGCATGAATATTGATGATGGTCGTGTTGTCAGCAATTTCATAGCCCAAGCAATCCGTAATGAGCCTTTGACTGTTCAATTACCTGGAACACAGACAAGGAGTTTCTGTTATGTCTCAGATATGGTTGATGGCCTTATTCGACTTATGGAAGGAGAGAACACTGGGCCAATCAATATTGGGAATCCAGGTGAATTCACAATGCTCGAACTTGCAGAGGCTGTGAAGGAGCTTATCAATCCTGAGGTGCAAATATCCATGGTTGAAAACACTCCTGATGATCCTCGCCAGAGGAAGCCAGACATAACCAAGGCAAAGGAGCTGCTAGGATGGGAACCAACTGTCAAATTGCGCGATGGACTTCCTCTTATGGAGGAAGATTTCCGTCAGAGGCTTGGGGTATCCAGGAAGAACTGA
Protein:  
MSANGDHNSASKKPPSPSPLRFSKFFQSNMRILVTGGAGFIGSHLVDKLMENEKNEVIVVDNFFTGSKDNLRKWIGHPRFELIRHDVTEALLVEVDQIYHLACPASPIFYKYNPVKTIKTNVIGTLNMLGLAKRVGARILLTSTSEVYGDPLEHPQTESYWGNVNPIGVRSCYDEGKRVAETLMFDYHRQHGIEIRIARIFNTYGPRMNIDDGRVVSNFIAQAIRNEPLTVQLPGTQTRSFCYVSDMVDGLIRLMEGENTGPINIGNPGEFTMLELAEAVKELINPEVQISMVENTPDDPRQRKPDITKAKELLGWEPTVKLRDGLPLMEEDFRQRLGVSRKN